log-bilinear model
log-bilinear language model
The probability of occurrence of the following words under a given context c is
$ P(w|c) = \frac{\exp (\phi(w)\cdot c)}{\sum_{w'\in V} \exp (\phi(w)\cdot c)}
The language model is that $ \phi is an embedding function for the appropriate word
In A. Mnih and G. Hinton (2007), a little more advanced than mere inner product
https://gyazo.com/4b42aec31043b4fdfd07d353c4ee54ff
https://gyazo.com/12be19af95d085079f65326e8896b110
Log Bilinear Language Model
A. Mnih and G. Hinton (2007)
"Three new graphical models for statistical language modelling",
A. Mnih, Y. Zhang, and G. Hinton (2009)
"Improving a statistical language model through non-linear prediction",
Neurocomputing, vol. 72, no. 7-9, pp. 1414 – 1418
---
This page is auto-translated from /nishio/対数双線形モデル. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. I'm very happy to spread my thought to non-Japanese readers.